Selective display of OCR&#39;ed text and corresponding images from publications on a client device

ABSTRACT

Text is extracted from a source image of a publication using an Optical Character Recognition (OCR) process. A document is generated containing text segments of the extracted text. The document includes a control module that responds to user interactions with the displayed document. Responsive to a user selection of a displayed text segment, a corresponding image segment from the source image containing the text is retrieved and rendered in place of the selected text segment. The user can select again to toggle the display back to the text segment. Each text segment can be tagged with a garbage score indicating its quality. If the garbage score of a text segment exceeds a threshold value, the corresponding image segment can be automatically displayed instead.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.12/366,547, filed on Feb. 5, 2009, entitled “Selective Display of OCR'edText and Corresponding Images from Publications in a Client Device,”which claims the benefit and priority of U.S. Provisional PatentApplication No. 61/147,901, filed on Jan. 28, 2009, entitled “Click toToggle OCR'd text with Image,” which are incorporated by referenceherein in their entirety.

BACKGROUND

1. Field of Disclosure

The disclosure generally relates to the field of optical characterrecognition (OCR), in particular to displaying text extracted using OCRand the original images from which the text was extracted.

2. Description of the Related Art

As more and more printed documents have been scanned and converted toeditable text using Optical Character Recognition (OCR) technology,people increasingly read such documents using computers. When reading adocument on a computer screen, users typically prefer the OCR'ed versionover the image version. Compared to the document image, the OCR'ed textis small in size and thus can be transmitted over a computer networkmore efficiently. The OCR'ed text is also editable (e.g., supports copyand paste) and searchable, and can be displayed clearly (e.g., using alocally available font) and flexibly (e.g., using a layout adjusted tothe computer screen), providing a better reading experience. The aboveadvantages are especially beneficial to those users who prefer to readon their mobile devices such as mobile phones and music players.

However, errors often exist in the OCR'ed text. Such errors may be dueto imperfections in the documents, artifacts introduced during thescanning process, and shortcomings of OCR engines. These errors caninterfere with use and enjoyment of the OCR'ed text and detract from theadvantages of such text. Therefore, there is a need for a way to realizethe benefits of using OCR'ed text while minimizing the impact of errorsintroduced by the OCR process.

SUMMARY

Embodiments of the present disclosure include a method (andcorresponding system and computer program product) for displaying textextracted from an image using OCR.

In one aspect, an OCR'ed document is generated for a collection ofOCR'ed text segments. Each text segment in the document is tagged withinformation that uniquely identifies a rectangular image segmentcontaining the text segment in the original text image in the sequenceof images from the original document. The document also contains programcode that enables a reader to toggle the display of a text segmentbetween the OCR'ed text and the corresponding image segment responsiveto a user selection.

In another aspect, a garbage score is calculated for each text segment.Each text segment in the OCR'ed document is tagged with its garbagescore. When the OCR'ed document is loaded, the embedded program codecompares the garbage score of each text segment with a threshold value.If the garbage score of a text segment is below the threshold, theprogram code displays the text segment. Otherwise, the program codedisplays the image segment in place of the text segment. The user cantoggle the display by selecting the text segment.

The features and advantages described in the specification are not allinclusive and, in particular, many additional features and advantageswill be apparent to one of ordinary skill in the art in view of thedrawings, specification, and claims. Moreover, it should be noted thatthe language used in the specification has been principally selected forreadability and instructional purposes, and may not have been selectedto delineate or circumscribe the disclosed subject matter.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a high-level block diagram of a computing environmentaccording to one embodiment of the present disclosure.

FIG. 2 is a high-level block diagram illustrating an example of acomputer for use in the computing environment shown in FIG. 1 accordingto one embodiment of the present disclosure.

FIG. 3 is a high-level block diagram illustrating modules within adocument serving system according to one embodiment of the presentdisclosure.

FIG. 4 is a flow diagram that illustrates the operation of the documentserving system according to one embodiment of the present disclosure.

FIG. 5 is a flow diagram that illustrates the operation of a controlmodule generated by the document serving system according to oneembodiment of the present disclosure.

FIGS. 6A-6C are screenshots illustrating a user experience of reading aweb page generated by the document serving system according to oneembodiment of the present disclosure.

DETAILED DESCRIPTION

The computing environment described herein enables readers of OCR'edtext to conveniently toggle a display between a segment of OCR'ed textand a segment of the source image containing the text segment.

The Figures (FIGS.) and the following description describe certainembodiments by way of illustration only. One skilled in the art willreadily recognize from the following description that alternativeembodiments of the structures and methods illustrated herein may beemployed without departing from the principles described herein.Reference will now be made in detail to several embodiments, examples ofwhich are illustrated in the accompanying figures. It is noted thatwherever practicable similar or like reference numbers may be used inthe figures and may indicate similar or like functionality.

System Environment

FIG. 1 is a high-level block diagram that illustrates a computingenvironment 100 for converting printed publications into OCR'ed text andallowing readers to view the OCR'ed text and corresponding source imageas desired, according to one embodiment of the present disclosure. Asshown, the computing environment 100 includes a scanner 110, an OCRengine 120, a document serving system 130, and a client device 140. Onlyone of each entity is illustrated in order to simplify and clarify thepresent description. There can be other entities in the computingenvironment 100 as well. In some embodiment, the OCR engine 120 and thedocument serving system 130 are combined into a single entity.

The scanner 110 is a hardware device configured to optically scanprinted publications (e.g., books, newspapers) and convert the printedpublications to digital text images. The output of the scanner 110 isfed into the OCR engine 120.

The OCR engine 120 is a hardware device and/or software programconfigured to convert (or translate) source images into editable text(hereinafter called OCR'ed text). The OCR engine 120 processes thesource images using computer algorithms and generates correspondingOCR'ed text.

In addition, the OCR engine 120 generates and outputs positionalinformation describing the image segments containing the OCR'ed text inthe source images. For example, for each segment of text (e.g.,paragraph, column, title), the OCR engine 120 provides a set of valuesdescribing a bounding box that uniquely specifies the segment of thesource image containing the text segment. The values describing thebounding box include two-dimensional coordinates of the top-left cornerof a rectangle on an x-axis and a y-axis, and a width and a height ofthe rectangle. Therefore, the bounding box uniquely identifies a regionof the source image as the image segment corresponding to the textsegment. In other embodiments the bounding box can specify imagesegments using shapes other than rectangle.

The OCR engine 120 may also generate a confidence level that measures aquality of the OCR'ed text. In addition, the OCR engine 120 may generateother information such as format information (e.g., font, font size,style). Examples of the OCR engine 120 include ABBYY FineReader OCR,ADOBE Acrobat Capture, and MICROSOFT Office Document Imaging. The outputof the OCR engine 120 is fed into the document serving system 130.

The document serving system 130 is a computer system configured toprovide electronic representations of the printed publications to users.The document serving system 130 stores information received from the OCRengine 120 including the OCR'ed text, the source images, the positionalinformation relating segments of the OCR'ed text to segments of thesource images, and the confidence levels. In one embodiment, thedocument serving system 130 uses the received information to calculate a“garbage score” for each text segment of the OCR'ed text that measuresits overall quality. In addition, the document serving system 130includes a control module 132 that can be executed by client devices140. The control module 132 allows users of the client devices 140 toselectively toggle display of a text segment and the corresponding imagesegment, thereby allowing the user to view either the OCR'ed text or theportion of the source image of the printed publication from which thetext was generated.

In one embodiment, the document serving system 130 provides a websitefor users to read OCR'ed printed publications as web pages using clientdevices 140. Upon receiving a request from a client device for aparticular portion of a printed publication, the document serving system130 generates a document (e.g., a web page) containing the requestedportion of the publication. In one embodiment, the document includes thetext segments in the requested portion of the publication (e.g., thetext for a chapter of a book). In addition, the document includes thepositional information relating the text segments to the correspondingimage segments, and the garbage scores for the text segments. Thedocument also includes the control module 132. The document servingsystem 130 provides the generated document to the requesting clientdevice 140.

The client device 140 is a computer system configured to requestdocuments from the document serving system 130 and display the documentsreceived in response. This functionality can be provided by a readingapplication 142 such as a web browser (e.g., Microsoft InternetExplorer™, Mozilla Firefox™, and Apple Safari™) executing on the clientdevice 140. The reading application 142 executes the control module 132included in the document received from the document serving system 130,which in turn allows the user to toggle the portions of the documentbetween display of the text segment and display of the correspondingimage segment.

The scanner 110 is communicatively connected with the OCR engine 120;the OCR engine 120 is communicatively connected with the documentserving system 130; and the document serving system 130 iscommunicatively connected with the client device 140. Any of theconnections may be through a wired or wireless network. Examples of thenetwork include the Internet, an intranet, a WiFi network, a WiMAXnetwork, a mobile telephone network, or a combination thereof.

Computer Architecture

The entities shown in FIG. 1 are implemented using one or morecomputers. FIG. 2 is a high-level block diagram illustrating an examplecomputer 200. The computer 200 includes at least one processor 202coupled to a chipset 204. The chipset 204 includes a memory controllerhub 220 and an input/output (I/O) controller hub 222. A memory 206 and agraphics adapter 212 are coupled to the memory controller hub 220, and adisplay 218 is coupled to the graphics adapter 212. A storage device208, keyboard 210, pointing device 214, and network adapter 216 arecoupled to the I/O controller hub 222. Other embodiments of the computer200 have different architectures.

The storage device 208 is a computer-readable storage medium such as ahard drive, compact disk read-only memory (CD-ROM), DVD, or asolid-state memory device. The memory 206 holds instructions and dataused by the processor 202. The pointing device 214 is a mouse, trackball, or other type of pointing device, and is used in combination withthe keyboard 210 to input data into the computer system 200. Thegraphics adapter 212 displays images and other information on thedisplay 218. The network adapter 216 couples the computer system 200 toone or more computer networks.

The computer 200 is adapted to execute computer program modules forproviding functionality described herein. As used herein, the term“module” refers to computer program logic used to provide the specifiedfunctionality. Thus, a module can be implemented in hardware, firmware,and/or software. In one embodiment, program modules are stored on thestorage device 208, loaded into the memory 206, and executed by theprocessor 202.

The types of computers 200 used by the entities of FIG. 1 can varydepending upon the embodiment and the processing power required by theentity. For example, the document serving system 130 might comprisemultiple blade servers working together to provide the functionalitydescribed herein. As another example, the client device 140 mightcomprise a mobile telephone with limited processing power. The computers200 can lack some of the components described above, such as keyboards210, graphics adapters 212, and displays 218.

Example Architectural Overview of the Document Serving System

FIG. 3 is a high-level block diagram illustrating a detailed view ofmodules within the document serving system 130 according to oneembodiment. Some embodiments of the document serving system 130 havedifferent and/or other modules than the ones described herein.Similarly, the functions can be distributed among the modules inaccordance with other embodiments in a different manner than isdescribed here. As illustrated, the document serving system 130 includesa text evaluation engine 310, a code generation module 320, a documentgeneration module 330, an Input/Output management module (hereinaftercalled the I/O module) 340, and a data store 350.

The text evaluation engine 310 generates garbage scores for textsegments based on information provided by the OCR engine 120. Thegarbage score is a numeric value that measures an overall quality of thetext segment. In one embodiment, the garbage score ranges between 0 and100, with 0 indicating high text quality and 100 indicating low textquality.

To generate the garbage score, an embodiment of the text evaluationengine 310 generates a set of language-conditional characterprobabilities for each character in a text segment. Eachlanguage-conditional character probability indicates how well thecharacter and a set of characters that precede the character in the textsegment concord with a language model. The set of characters thatprecede the character is typically limited to a small number (e.g. 4-8characters) such that characters in compound words and other joint wordsare given strong probability values based on the model. Thelanguage-conditional character probabilities may be combined with otherindicators of text quality (e.g., the confidence levels provided by theOCR engine 120) to generate a text quality score for each character inthe text segment. The calculation of such a value allows for thelocation-specific analysis of text quality.

The text evaluation engine 310 combines the set of text quality scoresassociated with the characters in a text segment to generate a garbagescore that characterizes the quality of the text segment. The textevaluation engine 310 may average the text quality scores associatedwith the characters in the text segment to generate the garbage score.

The code generation module 320 generates or otherwise provides thecontrol module 132 that controls display of the document on the clientdevice 140. In one embodiment, the control module 132 is implementedusing browser-executable code written using a programming language suchas JAVASCRIPT, JAVA, or Perl. The code generation module 320 can includeor communicate with an application such as the Google Web Toolkit and/orprovide an integrated development environment (IDE) allowing developersto develop the control module 132. Depending upon the embodiment, thecode generation module 320 can store a pre-created instance of thecontrol module 132 that can be included in documents provided to theclient devices 140 or can form a control module 132 in real-time asclient devices 140 request documents from the document serving system130.

The document generation module 330 generates the documents providingportions of the publications to the requesting client devices 140. Inone embodiment, the generated documents are web pages formed using theHypertext Markup Language (HTML). Other embodiments generate documentsthat are not web pages, such as documents in the Portable DocumentFormat (PDF), and/or web pages formed using languages other than HTML.

To generate a document, the document generation module 330 identifiesthe publication and portion being requested by a client device 140, andretrieves the text segments constituting that portion from the datastore 350. The document generation module 330 creates a document havingthe text segments, and also tags each text segment in the document withthe positional information relating the text segment to thecorresponding image segment from the source image. The documentgeneration module 330 also tags each text segment with its associatedgarbage score. In addition, the document generation module 330 embedsthe control module 132 provided by the code generation module 320 in thedocument. The document generation module 330 may generate the documentwhen the OCR'ed text becomes available. Alternatively, the documentgeneration module 330 may dynamically generate the document on demand(e.g., upon request from the client device 140).

The I/O module 340 manages inputs and outputs of the document servingsystem 130. For example, the I/O module 340 stores data received fromthe OCR engine 120 in the data store 350 and activates the textevaluation engine 310 to generate corresponding garbage scores. Asanother example, the I/O module 340 receives requests from the clientdevice 140 and activates the document generation module 330 to providethe requested documents in response. If the document serving systemreceives a request for an image segment, the I/O module 340 retrievesthe image segment from the data store 350 and provides it to the clientdevice 140. In one embodiment, the I/O module 340 processes the imagesegment before returning it to the client device 140. For example, theI/O module 340 may adjust a size and/or a resolution of the imagesegment based on a resolution of the screen of the client device 140displaying the document.

The data store 350 stores data used by the document serving system 130.Examples of such data include the OCR'ed text and associated information(e.g., garbage scores, positional information), source images, andgenerated documents. The data store 350 may be a relational database orany other type of database.

Document and Control Module

According to one embodiment, the document serving system 130 generatesdocuments with embedded control modules 132. A document contains textsegments tagged with information for identifying the corresponding imagesegments. The text segments are also tagged with format informationdesigned to imitate the original text in the source image. Such formatinformation includes font, font size, and style (e.g., italic, bold,underline).

An embodiment of the control module 132 includes event handlers thathandle events related to the document. For example, responsive to thedocument being loaded into a web browser at a client device 140 (anon-load event), the control module 132 generates a display of theincluded text segments using HTML text tags. As another example,responsive to a user selection of a text segment, the control module 132toggles the display between the text segment and the corresponding imagesegment.

In one embodiment, when the web page is loaded by a web browser, theembedded control module compares the garbage score of each text segmentwith a threshold value to determine whether the text segment is ofsufficient quality for display. If the garbage score equals or is belowthe threshold value, the control module displays the text segment usingHTML code such as the following:

<p id=‘pageID.40.paraID.1.box.103.454.696.70.garbage.40’> <i>Thecourtyard of the Sheriff&#39;s house. A chapel. A shed in which is ablacksmith&#39;s forge with fire. A prison near which is an anvil,before which Will Scarlet is at work making a sword.</i></p>The above HTML code includes the following text in italic style: “Thecourtyard of the Sheriff's house. A chapel. A shed in which is ablacksmith's forge with fire. A prison near which is an anvil, beforewhich Will Scarlet is at work making a sword.” The paragraph is taggedwith the following information “id=′pageID.40.paraID.1.box.103.454.696.70.garbage.40′”, indicating that thecorresponding image segment is located on page 40 (pageID.40), paragraph1 (paraID.1), that the top-left corner of the image segment is locatedat (103, 454), that the image segment is 696 pixels in height and 70pixels in length, and that the associated garbage score is 40(garbage.40).

If the garbage score exceeds the threshold value, the control module 132automatically retrieves the image segment and displays the image segmentinstead of the text segment using HTML code such as the following:

<p id=‘pageID.40.paraID.1.box.103.454.696.70. garbage.40’><imgsrc=“image?bookID=0123&pageID=40&paraID=1&x=103&y=454&h= 696&w=70”display=“100%”></p>The above HTML code retrieves an image segment containing the same textas the paragraph above, and displays the image segment in place of thetext segment. It is noted that the bookID can be hardcoded in thedocument by the document generation module 330. The threshold value canbe set by a user or preset in the document.

A user can also specify whether the document displays a text segment oran image segment. For example, the user can use a keyboard or a pointingdevice to activate the text segment, or tap the text segment on a touchsensitive screen. Responsive to a user selection, the control module 132dynamically toggles the display between the text segment and thecorresponding image segment. When the display is toggled from the textsegment to the image segment, the control module 132 requests the imagesegment from the document serving system 130 with information thatuniquely identifies the image segment (e.g., page number, paragraphnumber, binding box), inserts an image tag of the image segment into theweb page, and renders the image segment to the user in place of the OCR'ed text. Even though it is not displayed, the text segment is stored ina local variable such that when the user toggles back, the correspondingtext can be readily displayed.

Typically when an image segment is displayed, the control module 132configures the display to be 100%, indicating that the image should beresized to fill up the whole width of the screen. However, when a textsegment (e.g., a short utterance or a title line such as “Chapter One”)is very short (e.g., less than 50% of a line), the control module can beconfigured to display the image as a similar percentage of the screenwidth.

Overview of Methodology for the Document Serving System

FIG. 4 is a flowchart illustrating a method 400 for the document servingsystem 130 to interactively provide a document to a client device 140for viewing by a user according to one embodiment. Other embodiments canperform the steps of the method 400 in different orders. Moreover, otherembodiments can include different and/or additional steps than the onesdescribed herein. The document serving system 130 can perform multipleinstances of the steps of the method 400 concurrently and/or inparallel.

Initially, the document serving system 130 receives 410 the OCR'ed text,source images, and associated information (e.g., positional information,confidence levels) from the OCR engine 120. The document serving system130 calculates 420 a garbage score for each OCR'ed text segment (e.g.,through the text evaluation engine 310), and generates 430 a controlmodule 132 to be included in a document (e.g., through the codegeneration module 320).

The document serving system 130 receives 440 a request from a clientdevice 140 for a portion of a publication (e.g., a chapter of a book),retrieves the text segments constituting the requested portion from thedata store 350, and generates 450 a document such as a web pageincluding the text segments. The text segments are tagged with relatedattributes including the positional information and the garbage scores.The generated document also includes the control module 132. Thedocument serving system 130 transmits 460 the generated document to theclient device 140 that requested it.

As described above, the user can interact with the document to view theimage segment instead of the corresponding text segment. When thecontrol module 132 executing at the client device 140 receives a requestto display an image segment, it transmits an image request withparameters uniquely identifying the image segment to the documentserving system 130. The document serving system 130 receives 470 theimage request, retrieves 480 the requested image segment, and transmits490 it to the client device 140 for display. The image request mayprovide additional information such as a resolution of the screendisplaying the document. The document serving system 130 may process theimage segment (e.g., resize, adjust resolution) based on suchinformation before transmitting 490 the processed image segment to theclient device 140 for display.

Overview of Methodology for the Control Module

FIG. 5 is a flow diagram that illustrates an operation 500 of thecontrol module 132 included in a document according to one embodiment.The control module 132 is executed by the reading application 142 (e.g.,a web browser) at the client device 140 when the document is displayedby the application. In an alternative embodiment, the functionality ofthe control module 132 is provided by the reading application 142 itself(e.g., by a plug-in applet). Thus, the control module 132 needs not beincluded in the document sent by the document serving system 130 to theclient device 140.

As shown, when the document is loaded, the control module 132 generates510 a display of the document. As described above, the control module132 compares the garbage score of each text segment with a thresholdvalue to determine whether to display the text segment or thecorresponding image segment.

The control module 132 monitors for and detects 520 a user selection ofa displayed segment. The control module 132 determines 530 whether theselected segment is currently displayed as a text segment or as an imagesegment. If the displayed segment is a text segment, the control module132 requests 540 the corresponding image segment, receives 550 therequested image segment, and displays 560 the received image segment inplace of the text segment. Otherwise, the control module 132 replaces570 the image tag for the image segment with the text segment. In oneembodiment, the control module 132 stores the undisplayed text segmentslocally in the document (e.g., in local JavaScript variables), such thatit does not need to request and retrieve the text segment from thedocument serving system 130 when the user toggles the display back totext. After the display switch, the control module 132 resumesmonitoring for user selections.

Example

FIGS. 6A-6C are screenshots illustrating a user experience ofinteracting with a document according to one embodiment of the presentdisclosure. In this example, the document is a web page. As shown inFIG. 6A, a user retrieves a web page generated for an OCR'ed book titled“A Christmas Carol: Being a Ghost of Christmas Past” using an APPLEiPHONE client. The web page contains pages 120-130 of the book.

The user desires to view the image segment for a paragraph 610, and tapsthe display of the paragraph. In response, the control module 132replaces the text segment for the paragraph 610 with an interstitialimage 620, as shown in FIG. 6B. The interstitial image 620 shows thetext “Loading original book image . . . (Tap on the image to go back toprevious view)”. The interstitial image 620 is designed to help theusers understand the action as well as provide a clear guidance on howto get back. For example, if the network connection of the client device140 is poor, it may take a while to load the original image segmentcontaining paragraph 610. The user can tap the interstitial image 620 tocancel the action and resume viewing the text segment. The interstitialimage 620 also helps with reducing the perceived loading time.

When the image segment 630 is retrieved, the control module 132 swaps inthe image segment 630 to replace the text segment, as shown in FIG. 6C.The user can then tap again to revert to the text segment as shown inFIG. 6A.

Some portions of above description describe the embodiments in terms ofalgorithmic processes or operations. These algorithmic descriptions andrepresentations are commonly used by those skilled in the dataprocessing arts to convey the substance of their work effectively toothers skilled in the art. These operations, while describedfunctionally, computationally, or logically, are understood to beimplemented by computer programs comprising instructions for executionby a processor or equivalent electrical circuits, microcode, or thelike. Furthermore, it has also proven convenient at times, to refer tothese arrangements of functional operations as modules, without loss ofgenerality. The described operations and their associated modules may beembodied in software, firmware, hardware, or any combinations thereof.

As used herein any reference to “one embodiment” or “an embodiment”means that a particular element, feature, structure, or characteristicdescribed in connection with the embodiment is included in at least oneembodiment. The appearances of the phrase “in one embodiment” in variousplaces in the specification are not necessarily all referring to thesame embodiment.

Some embodiments may be described using the expression “coupled” and“connected” along with their derivatives. It should be understood thatthese terms are not intended as synonyms for each other. For example,some embodiments may be described using the term “connected” to indicatethat two or more elements are in direct physical or electrical contactwith each other. In another example, some embodiments may be describedusing the term “coupled” to indicate that two or more elements are indirect physical or electrical contact. The term “coupled,” however, mayalso mean that two or more elements are not in direct contact with eachother, but yet still co-operate or interact with each other. Theembodiments are not limited in this context.

As used herein, the terms “comprises,” “comprising,” “includes,”“including,” “has,” “having” or any other variation thereof, areintended to cover a non-exclusive inclusion. For example, a process,method, article, or apparatus that comprises a list of elements is notnecessarily limited to only those elements but may include otherelements not expressly listed or inherent to such process, method,article, or apparatus. Further, unless expressly stated to the contrary,“or” refers to an inclusive or and not to an exclusive or. For example,a condition A or B is satisfied by any one of the following: A is true(or present) and B is false (or not present), A is false (or notpresent) and B is true (or present), and both A and B are true (orpresent).

In addition, use of the “a” or “an” are employed to describe elementsand components of the embodiments herein. This is done merely forconvenience and to give a general sense of the disclosure. Thisdescription should be read to include one or at least one and thesingular also includes the plural unless it is obvious that it is meantotherwise.

Upon reading this disclosure, those of skill in the art will appreciatestill additional alternative structural and functional designs for asystem and a process for displaying OCR'ed text. Thus, while particularembodiments and applications have been illustrated and described, it isto be understood that the present invention is not limited to theprecise construction and components disclosed herein and that variousmodifications, changes and variations which will be apparent to thoseskilled in the art may be made in the arrangement, operation and detailsof the method and apparatus disclosed herein without departing from thespirit and scope as defined in the appended claims.

What is claimed is:
 1. A computer-implemented method for displaying apublication, comprising: receiving a document including a plurality oftext segments generated responsive to an Optical Character Recognition(OCR) process performed on a source image of the publication, whereineach text segment is associated with a garbage score indicating aquality of the text segment; comparing the garbage score of a particulartext segment of the plurality of text segments with a threshold value;responsive to the garbage score meeting the threshold value, displayingthe particular text segment on a display of a client device; andresponsive to the garbage score not meeting the threshold value,displaying an image segment from which the particular text segment wasgenerated via the OCR process instead of the particular text segment onthe display of the client device.
 2. The computer-implemented method ofclaim 1, further comprising: responsive to a selection of the imagesegment by a user, replacing the selected image segment with theparticular text segment on the display of the client device.
 3. Thecomputer-implemented method of claim 1, wherein the particular textsegment is replaced with the image segment from which the particulartext segment was generated via the OCR process on the display of theclient device responsive to a selection of the particular text segmentby a user.
 4. The computer-implemented method of claim 1, wherein thedocument includes positional information relating the included textsegments to image segments of the source image, wherein displaying theimage segment further comprises: identifying positional information inthe document for the particular text segment; and transmitting a requestfor the image segment, the request including the identified positionalinformation.
 5. The computer-implemented method of claim 4, wherein therequest for the image segment is transmitted to a remote server and theimage segment is retrieved from the remote server.
 6. Thecomputer-implemented method of claim 4, wherein the positionalinformation describes a region of the source image showing textcontained in the particular text segment.
 7. The computer-implementedmethod of claim 1, wherein displaying the image segment comprises:replacing the particular text segment with the image segment from whichthe particular text segment was generated via the OCR process in thedocument.
 8. A non-transitory computer-readable storage medium encodedwith executable computer program code for displaying a publication, thecomputer program code comprising program code for: receiving a documentincluding a plurality of text segments generated responsive to anOptical Character Recognition (OCR) process performed on a source imageof the publication, wherein each text segment is associated with agarbage score indicating a quality of the text segment; comparing thegarbage score of a particular text segment with a threshold value;responsive to the garbage score meeting the threshold value, displayingthe particular text segment on a display of a client device; andresponsive to the garbage score not meeting the threshold value,displaying an image segment from which the particular text segment wasgenerated via the OCR process instead of the particular text segment onthe display of the client device.
 9. The non-transitorycomputer-readable storage medium of claim 8, further comprising programcode for: responsive to a selection of the image segment by a user,replacing the selected image segment with the particular text segment onthe display of the client device.
 10. The non-transitorycomputer-readable storage medium of claim 8, wherein the particular textsegment is replaced with the image segment from which the particulartext segment was generated via the OCR process on the display of theclient device responsive to a selection of the particular text segmentby a user.
 11. The non-transitory computer-readable storage medium ofclaim 8, wherein the document includes positional information relatingthe included text segments to image segments of the source image, andwherein the program code for displaying the image segment comprisesprogram code for: identifying positional information in the document forthe particular text segment; and transmitting a request for the imagesegment, the request including the identified positional information.12. A computer-implemented method for providing a publication to a user,comprising: receiving a document including text segments generatedresponsive to an Optical Character Recognition (OCR) process performedon a source image of the publication, wherein the text segments areassociated with a plurality of garbage scores indicating quality levelsof the text segments, and wherein the document also includes positionalinformation specifying segments of the source image from which the textsegments were generated via the OCR process; providing the document to aclient device; and responsive to receiving a request from the clientdevice including positional information for an image segment from whicha particular text segment included in the document was generated via theOCR process, automatically providing the image segment to the clientdevice, wherein the client device is adapted to display the imagesegment based on a garbage score from the plurality of garbage scorescorresponding to the particular text segment.
 13. Thecomputer-implemented method of claim 12, further comprising: providingthe plurality of garbage scores to the client device.
 14. Thecomputer-implemented method of claim 12, further comprising generating agarbage score for an associated text segment, wherein generating thegarbage score comprises: calculating a set of language-conditionalcharacter probabilities for each character in the associated textsegment based on a language model; and generating the garbage score forthe associated text segment based on the set of language-conditionalcharacter probabilities for the characters in the associated textsegment.
 15. The computer-implemented method of claim 12, furthercomprising: including a control module in the document, the controlmodule adapted to execute on the client device to control display of thedocument on the client device and to automatically generate the requestfor an image segment from which the particular text segment wasgenerated via the OCR process.
 16. The computer-implemented method ofclaim 12, wherein the positional information describes a region of thesource image showing text contained in the particular text segment. 17.A non-transitory computer-readable storage medium encoded withexecutable computer program code for providing a publication to a user,the computer program code comprising program code for: receiving adocument including text segments generated responsive to an OpticalCharacter Recognition (OCR) process performed on a source image of thepublication, wherein the text segments are associated with a pluralityof garbage scores indicating quality levels of the text segments, andwherein the document also includes positional information specifyingsegments of the source image from which the text segments were generatedvia the OCR process; providing the document to a client device; andresponsive to receiving a request from the client device includingpositional information for an image segment from which a particular textsegment included in the document was generated via the OCR process,automatically providing the image segment to the client device, whereinthe client device is adapted to display the image segment based on agarbage score from the plurality of garbage scores corresponding to theparticular text segment.
 18. The non-transitory computer-readablestorage medium of claim 17, wherein the program code further comprisesprogram code for: providing the plurality of garbage scores to theclient device.
 19. The non-transitory computer-readable storage mediumof claim 17, wherein the program code further comprises program code forgenerating a garbage score for an associated text segment, the programcode comprising code for: calculating a set of language-conditionalcharacter probabilities for each character in the associated textsegment based on a language model; and generating the garbage score forthe associated text segment based on the set of language-conditionalcharacter probabilities for the characters in the associated textsegment.
 20. The non-transitory computer-readable storage medium ofclaim 17, further comprising program code for: including a controlmodule in the document, the control module adapted to execute on theclient device to control display of the document on the client deviceand to automatically generate the request for an image segment fromwhich the particular text segment was generated via the OCR process.